278 research outputs found

    A Task-based Evaluation of French Morphological Resources and Tools

    Get PDF
    Morphology is a key component for many Language Technology applications. However, morphological relations, especially those relying on the derivation and compounding processes, are often addressed in a superïŹcial manner. In this article, we focus on assessing the relevance of deep and motivated morphological knowledge in Natural Language Processing applications. We ïŹrst describe an annotation experiment whose goal is to evaluate the role of morphology for one task, namely Question Answering (QA). We then highlight the kind of linguistic knowledge that is necessary for this particular task and propose a qualitative analysis of morphological phenomena in order to identify the morphological processes that are most relevant. Based on this study, we perform an intrinsic evaluation of existing tools and resources for French morphology, in order to quantify their coverage. Our conclusions provide helpful insights for using and building appropriate morphological resources and tools that could have a signiïŹcant impact on the application performance

    Le projet RESTAURE

    Get PDF
    National audienceLe projet Ressources InformatisĂ©es et traitement automatique pour les langues rĂ©gionales (RESTAURE) est un projet financĂ© par l’ANR, entamĂ© au mois de janvier 2015 pour une durĂ©e de 42 mois. Il comporte trois objectifs principaux :‱ acquisition et normalisation de ressources (corpus et lexiques) ;‱ dĂ©veloppement d’outils pour l’acquisition et l’analyse de corpus ;‱ diffusion des rĂ©sultats auprĂšs du grand public.Les langues rĂ©gionales de France concernĂ©es par le projet sont au nombre de trois : le picard, l’alsacien et l’occitan. Chacune de ces langues est reprĂ©sentĂ©e par un laboratoire partenaire : LESCLAP Ă  Amiens pour le picard, LiLPa Ă  Strasbourg pour l’alsacien, et CLLE-ERSS Ă  Toulouse pour l’occitan. À cela s’ajoute un laboratoire en rĂ©gion parisienne, le LIMSI-CNRS, qui travaille sur les aspects de traitement automatique des langues.La motivation principale du projet est le manque de ressources informatisĂ©es pour les langues rĂ©gionales de France, en particulier pour les trois langues concernĂ©es par le projet

    Question Generation for French: Collating Parsers and Paraphrasing Questions

    Get PDF
    This article describes a question generation system for French. The transformation of declarative sentences into questions relies on two different syntactic parsers and named entity recognition tools. This makes it possible to further diversify the questions generated and to possibly alleviate the problems inherent to the analysis tools. The system also generates reformulations for the questions based on variations in the question words, inducing answers with different granularities, and nominalisations of action verbs. We evaluate the questions generated for sentences extracted from two different corpora: a corpus of newspaper articles used for the CLEF Question Answering evaluation campaign and a corpus of simplified online encyclopedia articles. The evaluation shows that the system is able to generate a majority of good and medium quality questions. We also present an original evaluation of the question generation system using the question analysis module of a question answering system

    Es esch fàscht wie Ditsch, oder net? Étiquetage morphosyntaxique de l'alsacien en passant par l'allemand

    Get PDF
    National audienceL’analyse morphosyntaxique est une pierre angulaire de nombreuses applications du traitement automatique des langues. Elle nĂ©cessite toutefois des efforts de dĂ©veloppement consĂ©quents, qu’il s’agisse d’annoter des corpus ou de produire des lexiques et des outils. Pour les langues moins dotĂ©es, il peut ĂȘtre utile d’exploiter leur proximitĂ© avec des langues disposant d’outils et de ressources. Dans cet article, nous nous intĂ©ressons plus particuliĂšrement aux dialectes alsaciens, qui prĂ©sentent de nombreuses similitudes avec l’allemand standard. Nous montrons qu’il est possible d’utiliser des outils dĂ©veloppĂ©s pour l’allemand afin de rĂ©aliser l’analyse morphosyntaxique de textes en alsacien. La mĂ©thode consiste Ă  transposer les mots outils des textes alsaciens vers leurs Ă©quivalents en allemand standard. Cette transposition nĂ©cessite pour seule ressource un lexiquebilingue des mots outils

    De-Confinement in high multiplicity proton-proton collisions at LHC energies

    Full text link
    Recently, the CMS Collaboration has published identified particle transverse momentum spectra in high multiplicity events at LHC energies s\sqrt s = 0.9-13 TeV. In the present work the transverse momentum spectra have been analyzed in the framework of the color fields inside the clusters of overlapping strings, which are produced in high energy hadronic collisions. The non-Abelian nature is reflected in the coherence sum of the color fields which as a consequence gives rise to an enhancement of the transverse momentum and a suppression of the multiplicities relative to the non overlapping strings. The initial temperature and shear viscosity to entropy density ratio η/s\eta/s are obtained. For the higher multiplicity events at s\sqrt s =7 and 13 TeV the initial temperature is above the universal hadronization temperature and is consistent with the creation of de-confined matter. In these small systems it can be argued that the thermalization is a consequence of the quantum tunneling through the event horizon introduced by the confining color fields, in analogy to the Hawking-Unruh effect. The small shear viscosity to entropy density ratio η/s\eta/s near the critical temperature suggests that the matter is a strongly coupled Quark Gluon Plasma.Comment: 5 pages, 4 figure

    Approches à base de fréquences pour la simplification lexicale

    Get PDF
    National audienceLa simplification lexicale consiste Ă  remplacer des mots ou des phrases par leur Ă©quivalent plus simple. Dans cet article, nous prĂ©sentons trois modĂšles de simplification lexicale, fondĂ©s sur diffĂ©rents critĂšres qui font qu'un mot est plus simple Ă  lire et Ă  comprendre qu'un autre. Nous avons testĂ© diffĂ©rentes tailles de contextes autour du mot Ă©tudiĂ© : absence de contexte avec un modĂšle fondĂ© sur des frĂ©quences de termes dans un corpus d'anglais simplifiĂ© ; quelques mots de contexte au moyen de probabilitĂ©s Ă  base de n-grammes issus de donnĂ©es du web ; et le contexte Ă©tendu avec un modĂšle fondĂ© sur les frĂ©quences de cooccurrences. ABSTRACT Studying frequency-based approaches to process lexical simplification Lexical simplification aims at replacing words or phrases by simpler equivalents. In this paper, we present three models for lexical simplification, focusing on the criteria that make one word simpler to read and understand than another. We tested different contexts of the considered word : no context, with a model based on word frequencies in a simplified English corpus ; a few words context, with n-grams probabilites on Web data, and an extended context, with a model based on co-occurrence frequencies. MOTS-CLÉS : simplification lexicale, frĂ©quence lexicale, modĂšle de langue

    Fatal Mycobacterium tuberculosis infection despite adaptive immune response in the absence of MyD88

    Get PDF
    Toll-like receptors (TLRs) such as TLR2 and TLR4 have been implicated in host response to mycobacterial infection. Here, mice deficient in the TLR adaptor molecule myeloid differentiation factor 88 (MyD88) were infected with Mycobacterium tuberculosis (MTB). While primary MyD88–/– macrophages and DCs are defective in TNF, IL-12, and NO production in response to mycobacterial stimulation, the upregulation of costimulatory molecules CD40 and CD86 is unaffected. Aerogenic infection of MyD88–/– mice with MTB is lethal within 4 weeks with 2 log10 higher CFU in the lung; high pulmonary levels of cytokines and chemokines; and acute, necrotic pneumonia, despite a normal T cell response with IFN-Îł production to mycobacterial antigens upon ex vivo restimulation. Vaccination with Mycobacterium bovis bacillus Calmette-GuĂ©rin conferred a substantial protection in MyD88–/– mice from acute MTB infection. These data demonstrate that MyD88 signaling is dispensable to raise an acquired immune response to MTB. Nonetheless, this acquired immune response is not sufficient to compensate for the profound innate immune defect and the inability of MyD88–/– mice to control MTB infection
    • 

    corecore